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ABSTRACT 

This paper examines whether failure to control for 
family background, aptitude, high school quality, high school 
curriculum, and community characteristics leads to biP.s in estimates 
of the economic return resulting from postsecondary education. High 
school specific fixed effects were used to control for all observed 
and unobserved characteristics common to students from a given high 
school and an instrumental variables procedure was used to deal with 
measurement error in reported education. The estimation was conducted 
using panel data on several persons from each of a large number of 
high schools sampled in the National Longitudinal Survey of the High 
School Class of 1972. Controlling for family background and aptitude 
and achievement measures led to a substantial reduction in estimates 
of the rate of return to education. Measurement error was not 
responsible for the reduction. However, the use of fixed effects to 
control for high school and community characteristics had only a 
modest eflr.ct on estimates of the rate of return to education, and 
controlling for high school curriculum made littDe difference. It is 
noted that this is an important finding because few data sets permit 
researchers to control for these factors when estimating the effect 
of education on wages. Included is an appendix of means and standard 
deviations. Contains 12 references. (Author) 
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CoBtroIUng for Penonal Cbaractcrlitkf, 
School aiBd Comniuilty Chinictcrlsticii and High School Cuirlculum 
When Eitlfflatlng the Return to Education 



ABSTRikCT 



This paiw studies wliedier failure to contrd for Cunily background, aptitude, high school quality, hi^ 
school curriculum, and cor-niunity characteristics leads to bias in estimates of the return to post secCi ^Iary 
education. High school specific fixed effects are used to control for all observed and unobserved characteristics 
common to students firom a given high school and an instrumental variables procedure is used to deal with 
measurement error in reported txlucation. The estimation is conducted using panel data on several persons firom 
each of a large number of high schools sampled in the National Longitudinal Survey of the High School Class of 
1972. 

Controlling for family btv^^ground and aptitude and achievement measures leads to a substantial 
reduction in estimates of the rate of return to education. Measuremeitt emv is not responsible f(v the reduction. 
However, the use of fixed effects ito control for higi school and conu^iunity characteristics has only a modest 
effect on estimates of the rate of n\uim to educadon, vid controlling f(x high school curriculum makes little 
difference. This is an important and reassuring finding because few data sets permit one to control for these 
factors when estimadng the effect of education on wages. 
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INTRODUCTION 

Hundreds of studies have sbov^n Uut more-educated workers receive higher wages and earnings than 
le8s*educated woikers.' The typical esti^iate is that college graduates earn about 30 percent more than high 
school graduates. Tliis earnings gap. though varying over time, has always been substantial.' Butthetnoad 
ayeemoit that moie-educated wMkers earn more than less-educated wofkers is accompanied by disagreement on 
the extern to which the difference iii earnings is due to the difference in education. Put of the earnings 
advantage of educated workers may arise from differences in ability, achievement in primary and secondary 
schod. quality of curriculum in primary and secondary school, ot school and community characteristics that 
happen to affect botli wages and years of education.' 

There is in fisct an extensive literature on the effects of Dsunily background and ability on estLnates of 
the return to education. Most studies have found that controlling for fsmily background and ability reduces 
estimates of the rate of return to educatkm. (See Orlliches (1979) and Siebert (1985) for surveys.) However, 
some studies have paid attention to the £act that mismeasurement of education becomes a mom serious problem 
when one controls for ability or family background; these studies fmd somewhat smaller levels of bias and in 
some cases obtain higher estimates of ti« kVhh to education. 

This paper examines whether failure to control for school quality, secondary school curriculum, and 
community characteristics leads to bias in estimates of the return to postsecondary education. Given the interest 
that college admissions committees show in this information, one might suspect that diese variables are 
correlated with postsecondary education. Failure to control for them will lead to bias if they affect the wage 



* Siebert (1985) and Willis (1986) provide recent surveys of the link between education and 
earnings. 

^ Recent research by Muiphy and Welch (1988) suggests that the college differential has increased 
substantially during the 1980s. 

' There are two main channels through which the spurious correlation between education and 
wages might arise. First, family background, primary and secondary school quality, and ability might 
affect the wage level independently of postsecondary schooUng. Second, family background, ability, 
and primary and secondaiy school characteristics may affect the production fiuK^tion of human capital. 
Students who are more able, from better family backgrounds, or from better schools may choose more 
postsecondary' education than lf.:s advantaged because they receive a larger payolf to a yc^ar in 
college. In this case, a regression estimate of the difference in earnings of a high school graduate and 
a college graduate will exceed the gain in earnings thai; a typical high school graduate would nv^lve if 
he or she had chosen college. Sec Sicbcit (1985), WilUs (1986), and Qriliches (1977) for discussions 
of these issues. 

1 
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level or the return to education. However, little is known about the question, in part because most of the data 
sets that have been used to study the returns to education such as the Current Population Survey, the Decennial 
Census, the Panel Study of Income Dynamics, and the National Longitudinal Surveys of Labor Market 
Experience contain relatively little infonnatk)n about school cuiriculum and the community/ Furthermore, it is 
hard to enviskm a data ibi that would ctmtain meanires of aU of the relevant school and community 
characteristics. Ihe educatkxi productioa fimction literature has not been tenibly successful in explaining 
differences in achievement cooss schools with school and community variables.* even dwugh there are 
substantial diffexences across schools in the parental and school cliaracteristics that we do observe. (See 
Appendix IWe 1.) One natunlly suspects that there are unobser^ differences among high schools and 
communities that are ifflp(»tani determinants of both education and wages. 

Data from the National Longitudinal Survey of the High School Class of 1972 (NLSHS72) and from a 
matching postsecondary transcript survey (PETS) provide an opportunity to make some progress on this issue. 
The most important feature of the NLSHS72 for presem purposes is that it contains several studenu from a large 
number of high schools (897 high schools in the subsample used in this analysis). This pennits the use of fixed 
effect estimation procedures that control for all observed and unobsttrved charocteristics common to students 
from the same high school. One may also use flxed effects to control for characteristics common to students in 
the same program (Le., academic or nonacademic track) v.ithin a given high school. In addition, the data set 
contains information on parental background, high school curriculum, and test scores. 

Of course, the more one controls for the observed and unobserved factors that lead to differences in 
education, the potentially mart serious the im>blem of measurement einnor in reported educaticm becomes, as 
Griliches (1977. 1979) iias emphasized. ,1 deal with the measurement <tmt issue by using the information on 
education from the postsecondary transcript survey to form instrumenU\l variables QW) for the responses provided 
by the sample membeis. 

The main conclusions of the study are fouri'old. First, controlling for family background leads to a 
substantial reduction in estimates of the rate of return to education. For the combined sample of academic and 
oonacademic track high school students, the ordinary least squares (OLS) estimates imply that the return to a 
year of postsecondary academic educaticHi falls froia 82 percent when one does not control fot family 



* Akin and Oarfinkel (1977), Morgan and Siiageldln (1973), and Johnson and Stafford (1973) 
examine links between ichool quality proxies and labor maricet outcomes. 

' See Hanushek (1986) for a recent survey. 
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background to 6.5 percent when one does. The IV results are similar, suggesting that measurement error is not 
responsible for the reduction. Similar reductions are found among the samples of students who are in high 
school academic programs and among those who are in nonacademic programs. Second, estimates of the rate of 
return to postsecondary academic education for academic and nonacademic ttack high school students are 
remarkably similar. This is true de^ite the £Kt that students from academic programs evn substantially more 
than those from nonacademic programs, even after controlling for observed family background characteristics and 
achievement and aptitude measures. Third, controlling for high school curriculum doeii not have much effect on 
the education coefficients. Fourth, the use of fixed effects to control for high school characteristics has only a 
modest effect on estimates of the rate of return to education. For the combined sample, controlling for these 
factors reduces estimates of the percentage increase in earnings from a year of cdlege by about .5 (e.g., from 6.0 
to 5.5). Hiis suggests that the failure to control for differences in high school variables does not lead to serious 
biases in studies of education and wages. This is an importfint and reassuring finding, because few data sets 
would permit one to control for these fac'tors. 

The paper proceeds as follows: Section 1 presents the wage equation that underlies most of the 
econometric analysis below. It also discusses the potential biases in estimates of the effec; of education on 
wages that may arise from failure to control for characteristics that correlate with educational attainment and that 
affect either the level of the wage holding the educational level constant or the relationship between wages and 
education. Secdon 2 discusses the data used in the study. Section 3 presents estimates of the reuim to 
education. The main conclusions are summarized in Section 4. 

Econometric Framework 

This section discusses sources of bias in estimates of the effect of education on wage rates, and presents 
the economeuic methodology used in the paper. 

Assume that the log real wage is determined by 

(1.1) W^'X^,^X^JB,^X^,^C^^ 

where 

Wa. 8 the log of the real average houriy wage rate of person i from high school h in year t. 
- variables indicating whether the individual is female, black, and/or Hispanic. 
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Xa <■ a Mt of family tjeckground characteristics. 

■ variables for the geographic location of the high school. 
- a vector of tpdtade and achievement measures. 

■ a vector of measures of the high school curriculum taken by person i from high school h. 

■ a measure of postsecondaiy education, such as years of schooling. 
EX^ « the labcff market experience of perscm i in year t. 

2^ B a vector of observed high school and community characteristics. 

The variable (Dai is the composite error component 

(12) ^ik*'^t*\*''*h*^iki 



where 

v'l ■ an index of student- and family-specific factors that affect W indq)endently oi the high school 

and community environment 
Vk ■ the mean in the high school of v'l. 
V| ■ the difference between v\ and v^ for student i. 
m„ ■ an index of high school and community factors that affect w. 

/a« B a transitory error component that is assumed to be uncoirelated with all explanatory variables in 
the wage equation and with the other error components. 

The component V| is uncorrelated with v^ and m^ by construction. 

There are two additional error components in the wage equation. The rate of return to education p f p, 
•»■ varies across individuals and depends on an individual-specific component P( and a high school component 
Pk, where p, and p^ are uncorrelated b) construction and have means of 0, The u.iobservii^ term (pi -f p^) is 
treated as part of the wage equation error in estimation. Below I allow p to u^nd on whether a student is in 
an academic or nonacademic program by estimating separate equations for these groups. 

Suppose one is interested in p, which is the labor market value of an c<;!ra year of education for a 
randomly selected person. Ordinary least squares will lead to biased estimates if the unobserved lactcrs that 

4 
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influeoce W also influence S. S ii likely to be poiitively related to variables that increase the productivity of 
higher education, lower the direct oosti to the student or lower the discount late, or raise the nonpecuniary 
benefits of education. Consequently, one.would expect family background, ability and achievement, course of 
study in high school, and other high school and community flactors to affect not only wage rues but also 
postsecondary schooling, and ^e evidence for the NLSHS72 is that they do (see Altoi^ji (1988)). In flMt, many 
studies of the return to education have few controls for ability, family background, curriculum in high school, 
and other characteristics of the high school and community. Even when one uses as rich a data set as the 
NLSHS72, the flBct that objKrved measures of fsmily background and parental attitudes, student achievement and 
ability, average background and ability of students in the high school, and high school characteristics are 
assumed to influence education and wage rates suggests that the unobserved determinants of education are 
correlated with and (pi -t- pj. This is because the observed measures are likely to be incomplete or 
unreliable. 

In the empirical woric below I systematically add to the wage equation controls for family background, 
curriculum in high school, aptitude vtd achievement, and observed high school characteristics and examine the 
sensitivity of estimates of the return to education to the choice of control variables. Since the NLSHS72 has 
mat than one student in the sample from each high school, I am also able to experiment with the use of a 
separate intercept, or "fixed effect." for each high school as a control for the high school error component 
Vh + n»fc.* 

It is important to keep in mind that the use of high school fixed effects does not solve all of the 
potential sources of endogeneity of the schooling variable. Specifically, it does not eliminate potential bias frum 
the correlation between and the individual error component Vt> or between S^ and the component p( of the 
rate of renun to education. Even after one conuols for observed measures of family background tad for 
aptitude and achievement, unobserved ability differences among students from the same high school may affect 
both S and W. Furthermore, the quality of instruction and peer group experiences of students i»obably varies 
substantially even within a given track within the same high school, and so the fixed effect analysis does not 



' The estimation methods discussed here are standard approaches to estimation of models with an 
error components stnicture. A convenient reference is Hsiao (1986), Chapter 3. The standard errors 
for the OLS and instrumental variables regressions wltli and without high school fixed effects allow 
for a.i)ltrary high school specific foims of heteroscedasticity. serial correlation, and ccneladon across 
students from the same high school. 

S 




control for all high school characteristics that influence ptuticular students.^ However, this study goes limher 
than previous studies by inchiding high school cuiriculuiA controls as well as Camily background and ability 
measures. 

One should also keep in mind that including controls for high school, Camily background, and test scores 
may exacerbate downward bias in the education coefficient arising Crom measurement enar in education. I 
address the measurement error issue by using the independent infonnation in the Post Secondary Transcript 
Survey that accompanied the NLSHS72 to create instrumental variables for the education measures. I use the 
transcript information as instrumental variables rather than as direct measures of education, because the PETS 
survey did not obtain transcripts for all Students who claimed to have attended postsecondaxy schools, in some 
cases because of lack of cooperation by the schools. Consequently, the PETS measure of postsecondaxy 
education wUl also differ ficm acuial schoding. I assume that the measurement errors in the PETS data are 
independent of the infonnation on years of schooling and degree attainment provided by the student.' When 
high school coistants are included in the wage equation, I use deviations of the transcript variables from high 
schod means as the instrumental variables. 

Data 

The NLSHS72 is a Department oi Education survey of individuals who were high school seniors during 
the spring of 1972. Thus, high school dropouts are excluded. Tht individuals were resurveyed in 1973, 1974, 
1976. and 1979. A subsample was also resuiveyed in 1986. I restrict the sample to the 16.683 individuals from 
the schods that participated in the base^year survey. The sample is n;uuc xl to 15.680 by eliminating 
observations with missing high school test information and to 12.980 b> eliminating individuals who did not 
re^xxid to all of the first four folbw-ups. Infonnation from the 1986 follow-up was then added for persons 
who were in the earlier sample of 12,980. 

The cross section-time series of observations for each individual used in the wage analysis was created 
using information on earnings divided by hours for 1977, 1978,.and 1979, and information on the wage at the 



' There is infonnation on tracking in the NLSHS72, and in future woric it would be interesting to 
use a fixed effect to contTol for observed and unobserved characteristics that are common to students 
from the same track in high school 

* Students were asked during each follow-up survey to identify any schools that they were 
attending or had attended. Correlated measurement errors could arise if a student attended college but 
said otherwise. In this case the smdent would not provide the name of the postsecondary school 
attended, and no Q^mscript would be found. I assume that persons do not hide the fact that they 
attended college if they attended college for a signlAcant period of time. 

6 
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beginning ud end of each Job beU between 1980 and 1986, up to a maximum of the four most recent Jobs. An 
observation for 1977 is included if (1) the individual was a ftill*time student neither in October 1976 nor in 
October 1977, (2) the number of hours worked in 1977 was greater than 1,040, and (3) the log of the 1977 real 
wage was between $^ and $75 in 1967 dolUn. Observations for 1978 and 1979 were included if they met the 
com^onding three criteria for 1978 and 1979, req)ectively. Data for begimsicg and ending Job dates (1980- 
1986) were included if (1) the number of hours wcriced in the appropriate year was greater than 1.040, a:td (2) 
the los of the real wage was between $iO and $75 in 1967 dollars. Restriction at the sample to cases with 
nonmissing data on the variables used in the wage analysis reduced the sample size to 38,595 observations on 
9,239 individuals from 897 high schools.' 

The coding of a few of the variables requires discussion. The variable YRSACD79 is the number of 
yean of postsecondary academic education complei'id by 1979. The variable YRSVOC79 is tiie number of years 
of postsecondary vocational education by 1979. To allow for nonlinearities in tiie return to education, a set of 
mutually exclusive education dummy variables were created. VOC79 is 1 if an individual never attended college 
but did attend a postsecondary vocational school, and is 0 otherwise. SOC1479 is 1 if a person has less than two 
years of college (regardless of wheUwr that person also attended vocational school). SOC1579 is 1 if a person 
attended college for two or more years but did not receive a four-year degree. COLL79 is 1 if a person received 
a four-year degree but did not receive an advanced degree. ADV79 is 1 if a person reived a graduate degree. 

The instrumental variables constructed from the PETS survey include a number of transcripts found for 
each student and nine dummy variables for whether students had tiie following transcript combinations: (1) at 
least one transcript, 0) a transcript from a nonacademic institution. (3) a transcript from a two-year public 
academic institution, (4) a transcript from a four-year public academic institution, (5) a transcript from a private 
academic institution. (6) a licence or certificate but no academic degree, (7) an associate degree but no 
bachelor's or advanced drgree, (8) a college degree but no advanced degree, and (9) an advanced degree.'" 



• The subsamplc of siudents in academic pnDgrams comaliis 18,653 pereon-ycar observations from 
858 high schools. The correspondbig figures for tiie nonacademic (general and voc ional traclts) 
subsamplc arc 19,942 and 864. 

The PETS survey contains at least one transcript for 83 percent of the saroi^e members who 
reported some postsecondary education by 1979, for 74.8 percent oi' those who reported vocational 
education or some college but no degree, and for 96.16 percent of Oiose who reported a college or 
advanced degree. Transcript evidence of a college or advanced degree was found for 82.29 perccm of 
the sample members who reported a college or advanced degtec. Tnsnscript evidence of a college or 
advanced degree was found for 3. 16 pen:em of sample who did not report a college or advanced 
degree by 1979. Also, transcript evidence of an advanced degree was found for 8.13 percent of the 
persons who reported college as tiicir highest degree In 1979, which may in part be due to completion 



The control variables for region and city size, family background, aptitude and achievement measures, 
high school curriculum (semester hours in each of eight subjects), and high school chancieristics are listed in the 
footnotes to the tables. Descriptive statistics and variable definitions are provided in Appendix Table 1." To 
save spact, only the education coefficienui are shown in the text tables below. 

EsUfflites of Returns to Edueatloo 
Table 1 presents least squares estimates of the effects of YRSACD79 and Y11SVOC79. Columns 1-4 
do not include dummy variables for each high school, while Columns S-7 do. All equations contain controls for 
race, Hispanic background, sex, experience, txpexitnce squared, time, and time squared. In each column the 
letters R, F, and H indicate whether controls for region and city size (R), family background and achievement 
and aptitude measures (F), and high school curriculum and high school characteristics (H) are included. The 
high school dummy variables absorb the effect of any variables that are constant within the high school, thus, 
region and city size and fixed high school characteristics are implicitly controlled for in Columns S-7. Table 2 
provides a comparable set results, with YRSACD79 and YRSVOC79 replaced by the dummy variables 
VOC79, SCX:i479, SOC1S79, COLL79, and ADV79. The coefficients on the education variables are relative to 
a high schof! graduate, so the coefficient on COLL79 is the difference in the log wages of a college graduate 
and a high school graduate. 

The Returns to Ac&demic Education 

The coefficient on YRSACD79 is .0S17 when only the basic conm>ls are iiKludcd (Column 1) and 
.0790 when one controls for region and city size. This is typical of estimates from other data sets. The 
co^icient Calls to .0653 when family background and ability and aptitude measures are added, a decline of 
.0137. This reduction is consistent with findings of other studies that have used detailed controls for family 



of their advanced degrees after 1979. 

" Appendix Table 1 presents the means and standard deviations of the variables used in the wage 
equations for the full sample, the academic sample, and the nonacademic sample. The table also 
repoits the standard deviation of each variable within a high school, and the firection of the sample 
variance that is across high schools. The standard deviations and the variance decomposition in the 
table refer to the cross sectlon>time scries sample, to which individuals contribute different numbers of 
observations. Consequently, they provide only a rough Indication of the relative impoitance of 
variation within the high school and variation acioss high schools in wages, education, and background 
whantcteristics. See Altoi^i (1988) for a more thorough treatment of this issue. However, the results 
indicate that there is substantial variation across high schools in background characteristics, aptitude 
and achievement measures, and curriculum. Note also that there are substantial differences in the 
means for the academic and nonacademic samples. 
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background and ability or made use of sibling pairs.'^ Adding controls for the student's courses and a set of 
high school characteristics lowen the YRSACD79 coefficient by only ,0009 to .0644. 

A nuyor aim of the present ]?aptt is to determine whether the £aiilure to control for unobserved high 
school and community characteristics leads to a substantial upward bias in estimates of the returns to education. 
The answer seems to be that there is only a small upward bias without these controls. For example, when one 
adds high school dummy variables to the q)Ccification in Column 2, which docs not contain controls for Camily 
background, aptiuide and achievement, cr courses taken, the coefficient on YRSACD79 falls firom .0790 to .0749 
(see Column S). When one controls for background and achievement, the comparable coefficienu without and 
with high school dununy variables are .0633 and .0605. When one ccmtrols for curriculum and observed high 
school characteristics, adding the high school dummy variables reduces the coefficient on YRSACD79 firom 
.0644 to .0598. Thus, failure to control for high school differences leads to an upward bias of .005 in the 
education coefficient, which (multiplying by 100) is an upward bias of .5 in the rate of return to education. 

Similar results are obtained for itiidents from academic and nonacademic programs. The coefficients 
for the two subgroiq)S are remaricably similar. They are also a bit below the coefficients for the combined 
sample. This refiects the ffci that both the wage level and the postsecondary schooling level are positively 
correlated with enrollment in an academic program, even after controlling (os background, aptitude and 
achievement, and semester hours by subject area. Controlling for background and ability has a relatively large 
effect on the education coefficient, while curriculum has only a small efi^ect (compare Columns 2 and 3 and 
Columns 3 and 4). Fbr the academic subgroups, including a sq)arate dummy foi' each high school reduces the 
estimated rate of return by about .6 (compare Columns 2 and 5 or Columns 3 and 6 and multiply by 100). For 
the nonacademic subgroup, the reduction in the rate of reuim is about .2, 

Table 2 reports OLS estimates of the effects of academic education when 'Jie dununy variables VOC79, 
SOC1479. SOC1579. COLL79, and ADV79 are used to parameterize the model. The results are qualitatively 
consistent with those based upon the linear specification in Table 1. 

Instrumental Variables Estimates 

For the combined sample, the use of IV in Table 3 has no effect (to four digits) on the return to 
YRSACD79, when one does not control for family background and test scores, and leads to a sUght reduction 
(relative to OLS) in r«±!;ates of the reoim lu academic education, when one does. Bvtdently, measurement error 



" Sec Grlliches (1979) and Olneck (1979) for discussions of alternative estimates of the return to 
education based on sibling data. 



in the education variable decieases from about .079 with only regional controls to .0S8 when family background 
and test scores are added. There is only a small drop in the IV estimate (from .038 to .056) when high school 
fixed effecu are added to the equation with fiunily background and test scores. This confirms the earlier OLS 
flnding that failure to control for high school, community, or both sets of variables leads to 0;>ly a small bias in 
estimates of the return lo education. 

The use of IV does not significantly change the conclusions based on OLS for the academic and 
nonacadlemic subgroups. Hie IV estimates of models that use five dummy variables for education outcomes 
indicate that controlling for high school makes almost no difference for academic education, if anything, it leads 
to an increase in the estimated return to vocaticxial educational education (sec Table 4). 

The Returns to Vocational Education 

Tables 1 and 3 report OLS and IV estimates of Uie effect of years of vocational education (YRSVOC79) 
on wages for Uie combined sample and Uie academic and nonacademic subgxxxips. The mean of YRSVOC79 is 
^110 for the combined sample and .5031 and .5183 for Uie academic and nonacademic subsamples, respectively. 
For Uie combined sample, Uie OLS results for Uie linear specification indicive a much lower reum for vocational 
education Uian for academic education, wiUi a coefficient of .0145 in Uie absence of controls (Column 1) and 
.0154 when one controls for background, aptitude and achievement, high school curriculum, and Uie high school 
(Column 7). Hte estimates for Uie academic and nonacademic samples are in Uie same general range, but Uie 
small coefficients should be treated cautiously for two reasons. First, vocational education is a very 
heterogenous category, and if programs lasting just a few monUis are coded as lasting a year, a downward bias 
would result. Second, Uieie is evidence Uiat Uie value of vocational education is lower if one has also obtained 
academic postsecondaiy education. I have excluded an interaction term from Uie specifications in Tables 1 and 3 
to simplify interpreting Uie results. However, Uie categorical specification in Tables 2 and 4 excludes persons 
who have boUi vocational and academic education from Uie vocational category. Therefore, Uie mean of VOC79 
is much lower for Uie academic sample Uuui Uie mean for Uie nonacademic sample, despite Uie fact Uiat Uie mean 
of YRSVOC79 is similar for Uie two groups. Uie categorical specification and die combined sample, Uie 
OLS estimates imply Uiat Uie vocational education dummy raises wages by .048 percent to .065 percent, 
depending upon what one controls for. I suspect Uiat Uiere are differences between academic and nonacademic 
students in Uie content of postsecondary vocational education, and Uiat Uiese may underlie Uie larger coefficient 
on VOC79 for Uie academic sample. 

The instrumental variables estimates for YRSVOC79 and VOC79 follow Uic same general pattern as Uie 
OLS estimates, but are imprecise, particularly for Uie academic sample. Some of Uie point estimates for 
YRSVOC79 are acbially negative but not statistically significant. However, for Uie combined sample Uic 
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coefQclent on VOC79 is quite nibstantial (.118) when one controls for the high school, family background, 
cufficulum, and tea scores, although the standard etror is .064. A possible explanation (other than sampling 
enor) is that the returns to vocational programs that are sufficiently well established to lead to a transcript, a 
licence, certificate, or any combination of the three are larger than the returns to other programs. 

Tht impact of controlling for high school and community charuteristics and for family background and 
achievement measures on estinutes of the return to vocational education is sensitive to whetha one uses OLS or 
IV, to the choice of dependent variable, and to whether the student was in an academic or no n acade mi c program 
in high school. I will not discuss the detailed results in the tables. There are a number of possible explanations 
for this, but I do not have the evidence on relative quality of high schools and of students who go on to 
vocational education to sort them ouL 

Conclusion 

The OLS and instrumental variables estimates with high school fixed effects indicate that only modsst 
biases result from the failure of previous studies to control for differences in high schools and for differences in 
primary school and community characteristics common to students from the same high school. This is good 
news fox researchers, because few data sets permit one to study clusters of students from the same high school. 
This paper also confirms previous studies that have found that estimates of the rate of return to education are 
reduced substantially when one controls for family background and aptitude and achievement measures. The 
reduction does not result from measurement error in reported education. 
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Table 1 

EftlBuitcf of the Return to Educatloii 
Dependent variable: log wage 
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controli****' 
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Rra 


R 
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Studenti In Academic Program' 
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.0731 
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.0663 


.0568 
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/ /V\0 1 \ 


(.OOoU) 


Other 

controI^^*'*^ 




R 






R 


RJ» 










Students in Nonacademic Programs* 
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.0563 


.0651 


.0547 
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(.0046) 
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(.0053) 
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(.0065) 


YRSVOC79 


.0196 


.0192 


.0162 
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.0138 


.0121 


.0116 




(.0074) 


(.0075) 


(.0073) 


(.0074) 


(.0078) 


(.0078) 


(.0078) 


Other 

controls^^ 




R 






R 


RJ' 


RJ'»H 



* The combined sample contains 38.595' pcfson-year observations on 9.239 persons from 897 high schools. The 
academic sample contains 18,653 persm-year obsovations on persons bam the academic programs in 858 high 
schools. The nonacademic sample contains 19.942 person-year observations on persons from the vocational a 
general program in 864 high schools. 

^ All equations include BLACK. HISP. CSEX. PEXPIO PEXPIOSQ. YEAR. AND YEARSQ. 

* R - NO.CENTRAL, SOUTH, WEST. SMALLTOWN. MED.CrTY, BIGCFTY, BIOSUBURB, HUOECITY, 
MED.SUBURB3IGSUBURB, HUGESURB, COLLJPROX. 

* F » FATHER^, MOTHER_PD, LOWSES ED.>10NEY MOTHER.WORK BLUECOLF. ENOUSH. 
FATH.COLL, MOTH-COLL. DISCJ>LANS. PAR_INTEREST, PARJNFL. IMPTAVER, 
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If) 



COhLBOEJ^Rm, TEACHER-ASSBSMENT. VOCABULARY. PICTUREMJMB. READING. 
LBnER.OROUP. MATH, MOSAIC.COMP. HOMEWORK. AND dummy vaiiiUes for whether dita were 
missing for FATH.CX)LL. MOIKCOLL. OR BLUBCX)LF. 

' H • BLACKl. BLACK1^ STUAIEACHER. PRIVATE. GRADES. ATTEND JUTES. DROP.RAIE. 
TBACHBR.TURNOVER. TEACHERJ^. SCIENCE.COURSE. HS_SIZE. SCIENCE. FORXANO. 
SOCIALS. ENGLISH. MATH, INDjS^TS. COMMERCIAL. FINE^TS. Summary statistics and definitions 
of all of the variablet are ^ven in Appendix Table 1. 

' Variables that do not vary across high schools, such as the geographic variablt^ 2nd the high school variables 
in footnote c and high school variablet in footnote e. are implicitly controlled for i^' the equations with high 
school dummies. 

■ "White" standard errors in parentheses account for arbitraiy ftnms of beteroscedasticity and coirelation across 
observations on students from a given high schod. 
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